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Gordonia alkanivorans strain CGMCC6845 is a halotolerant hydrocarbon-degrading bacterium isolated from petroleum- 
contaminated saline soil. Here we present the 5.0-Mb draft genome sequence of this strain, which will improve our understand- 
ing of the diversity of G. alkanivorans and the mechanisms of microbial hydrocarbon degradation in saline environment. 
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Gordonia alkanivorans can degrade diesel (1), dibenzothiophene, 
benzothiophene, and other thiophene analogs (2), which may 
have potential applications in the bioremediation of petroleum- 
contaminated soils and the biodesulfurization of oil. Therefore, hy- 
drocarbon biodegradation processes (1), biodesulfurization-related 
genes (3-5), and optimal conditions for the biodesulfurization (6) of 
G. alkanivorans have been well studied. Despite the increasing interest 
in G. alkanivorans, genomic information for this species is still lim- 
ited. To date, only one strain, G. alkanivorans NBRC16433, has been 
sequenced. Here, the draft genome sequence of the halotolerant 
hydrocarbon-degrading strain G. alkanivorans CGMCC6845 is pre- 
sented, which was isolated from petroleum-contaminated saline soil 
in China (7). 

Genomic DNA was extracted from a 3-day culture growing in 
tryptic soy broth and was sequenced using the Illumina HiSeq 
2000 platform with a whole-genome shotgun (WGS) strategy. The 
sequencing produced 8,467,694 paired-end reads with an insert 
size of 300 bp, yielding about 170-fold coverage. Filtered reads 
were assembled, scaffolded, gap filled, and validated using SOAP- 
denovo v2.04 (8), SSPACE v2.0 (9), GapFiller vl.10 (10), and 
BWA vO.7.4 (11). Final assembly consisted of 74 contigs with an 
N 50 length of 217,608 bp, which were assembled into 61 scaffolds 
with an N 50 length of 269,646 bp. Genome annotation was per- 
formed using the NCBI Prokaryotic Genome Automatic Annota- 
tion Pipeline (PGAAP) (http://www.ncbi.nlm.nih.gov/genome 
/annotation_prok/) . 

The genome consists of 5.0 Mb, with a G+C content of 
67.5%. A total of 4,476 coding sequences (CDS), 42 pseudogenes, 
1 noncoding RNA (ncRNA), 49 tRNA genes, and 3 rRNA operons 
were identified. A plasmid partitioning gene, parA, was detected 
on contig 20, which suggests the occurrence of a plasmid. We 
detected 349 tandem repeats using Tandem Repeats Finder v4.07 
(14). Neither prophage sequences nor clustered regularly inter- 
spaced short palindromic repeat (CRISPR) elements were present, 



as revealed by PHAST (12) and CRISPRFinder (13). IS3 andIS481 
families dominate the insertion sequence (IS) elements, as re- 
vealed by ISFinder (14). Average nucleotide identity (ANI) anal- 
ysis (15) revealed that G. alkanivorans CGMCC6845 is phyloge- 
netically related to G. alkanivorans NBRC16433 (98.0%). 

Nine genes were identified as involved in hydrocarbon degra- 
dation, including 3 alkanal monooxygenase genes, 2 catechol 1,2- 
dioxygenase genes, 2 ring-cleavage dioxygenase genes, 2 benzoate 
1,2-dioxygenase genes, and 1 dibenzothiophene desulfurization 
gene. Moreover, 8 genes were identified as involved in compatible 
solute synthesis and uptake, including 1 ectoine synthase gene, 2 
betaine synthase genes, 3 trehalose synthase genes, and 2 glycine/ 
betaine ABC transport genes, which may enhance the tolerance to 
osmotic stress. Copper-, arsenic-, and tellurium-resistant genes 
were detected, which may enhance the resistance to heavy metal. 
Information about the genome sequence of G. alkanivorans CG- 
MCC6845 will give a better understanding of the diversity of G. al- 
kanivorans and the mechanisms of microbial hydrocarbon degra- 
dation in saline environment. 

Nucleotide sequence accession number. The draft genome se- 
quence of G. alkanivorans CGMCC6845 has been deposited in 
GenBank under the accession number AYXO00000000. The ver- 
sion described in this paper is the first version. 
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